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Amendments to the Claims : 

This listing of claims will replace all prior versions and listings of claims in the application: 

1 . (Currently amended) A method for determining the similarity of data records 
in first and second data sets, the data records having an informational content, the method 
comprising: 

receiving a first data set from a first data source, the first data set including a first 
number of data records: 

receiving a second data set from a second data source, the second data set having a 
second number of data records: 

identifying a first data record in the first data set that is potentially identical to a second 
data record in the second data set and determining a similarity level between the first data 
record and the second data record , the identified first and second data records having an 
informational content that is non-identical but similar; 

determining whether the first and second data records already identified as potentially 
identical are truly identical based at least in part upon a pr e d e t e rmin e d crit e ria the similarity 
level and without reducing the first number of records or the second number of records . 

2. (Original) The method of claim 1 wherein identifying a first and second data 
records identifies telecommunication call detail records (CDRs). 

3 . (Original) The method of claim 1 wherein identifying a first data record and 
a second data record includes grouping the records in the first and second data sets into groups 
based upon a predetermined criteria. 

4. (Original) The method of claim 1 wherein identifying includes comparing the 
informational content of first data record to the informational content of the second data record. 

5. (Currently amended) A method for determining different data records in a 
telecommunications system from records in first and second data sets in a comprising: 

receiving a first data set from a first data source, the first data set including a first 
number of data records: 
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receiving a second data set from a second data source, the second data set having a 
second number of data records; 

identifying determining potentially different data records in the first data set at least in 
part by comparing records in the first data set to records in the second data se t wherein the 
compared records have a level of similarity therebetween, and wherein the de termining is 
achieved without reducing the first number of records or the second num ber of records; and 

verifying that the potentially different records already identified as potentially different 
are truly different at least in part by using at loaot on e prodotorminod crit e ria the level of 
similarity between the potentially different data records . 

6. (Original) The method of claim 5 wherein the different data records can be 
faulty data records or mismatched data records. 

7. (Original) The method of claim 5 wherein determining potentially different 
data records includes defining a set of similarity characteristics, grouping the data records in 
each of the first and second sets according to the similarity characteristics into similarity groups, 
and comparing the similarity groups in the first data set to the similarity groups in the second 
data set. 

8 . (Original) The method of claim 5 wherein determining potentially different 
data records includes determining whether each record in the first data set completely matches 
with a data record in the second data set. 

9. (Original) The method of claim 5 wherein verifying includes determining 
from the second data set a set of data records that are similar to the potentially different record 
identified in the first data set. 

10. (Original) The method of claim 5 wherein verifying includes scoring 
elements of each of the plurality of data records to form a plurality of scores, multiplying the 
plurality of scores to form a test score, comparing the test score to a predetermined minimum 
score, and determining a different record if the comparison determines the test score is 
unacceptable. 
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1 1 . (Original) The method of claim 5 further comprising taking an action relating 
to the different data records. 

12. (Currently amended) A device for determining faulty data records in a 
telecommunications system from records in first and second data sets in a comprising: 

a data store arranged and configured to store containing first and sooond data sets; and 
a processor coupled to the data store and having an outpu t, the processor arranged and 
configured to receive a first data set from a first data source, the first data set including a first 
number of data records and to receive a second data set from a second data source, the second 
data set having a second number of data records, the processor storing the first and second data 
sets in the data store . 

such that the processor identifies potentially different data records in the first data set at 
least in part by comparing records in the first data set to records in the second data set wherein 
the compared records have a level of similarity therebetween and verifies that the potentially 
different records already identified as potentially different are different at least in part by using 
at least nno prodotorminod crit e ria the similarity level and without reducing the first number of 
records or the second number of records, and such that the processor identifies the different 
records on the output. 

1 3 . (Original) The device of claim 1 2 wherein the processor includes means for 
defining a set of similarity characteristics, means for grouping the data records in each of the 
first and second sets according to the similarity characteristics into similarity groups, and 
means for comparing the similarity groups in the first data set to the similarity groups in the 
second data set. 

14. (Original) The device of claim 12 wherein the processor includes means for 
determining whether each record in the first data set completely matches with a data record in 
the second data set. 

1 5 . (Original) The device of claim 12 wherein the processor includes means for 
determining from the second data set a set of data records that are similar to the potentially 
faulty record identified in the first data set. 
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1 6. (Original) The device of claim 1 2 wherein the processor includes means for 
scoring elements of each of the plurality of data records to form a plurality of scores, means for 
multiplying the plurality of scores to form a test score, means for comparing the test score to a 
predetermined minimum score, and means for determining a different record if the comparison 
determines the test score is unacceptable. 

17. (Original) The device of claim 1 2 wherein the different data records can be 
faulty data records or mismatched data records. 
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